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Abstract 

We show that a random walk on the mapping class group of an orientable surface of finite 
type makes linear progress in the relative metric, which is quasi-isometric to the complex of 
curves. 
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1 Introduction 

Let £ be an orientable surface of finite type, which is not a sphere with three or fewer punctures. 
The mapping class group G of £ is the group of orientation preserving diffeomorphisms of £, modulo 
those isotopic to the identity. Let /jbea probability distribution on G. We may use \x to generate 
a random walk on G, which is a Markov chain on G with transition probabilities p(x, y) = fi(x~ 1 y), 
and we will assume we start at the identity at time zero. The path space for the random walk is 
the probability space (G z+ , P), where the product G z+ is the collection of all sample paths, and the 
measure P is determined by /i. Let w n be the random variable corresponding to projection onto 
the n-th factor. So if to is a sample path, w n (uj) is the location of the path at time n, and the 
distribution of w„ is given by the n-fold convolution of fi with itself, which we shall write \x n - We 
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say that the random walk has a linear rate of escape if the limit I = lim„_ s . 00 ^\w n (ui)\ exists for 
almost all sample paths ui, and £ is strictly greater than zero almost surely. Here \g\ is the length of 
the group element g in the word metric on the group G. Kcsten [3EI and Day [2] showed that an 
irreducible random walk on a non-amenable group has a linear rate of escape, assuming the random 
walk has finite first moment, i.e. the expected value of the distance the random walk moves in 
one step is finite. The mapping class group contains non-abelian free subgroups, so in particular is 
non-amenable. Therefore a random walk on the mapping class group makes linear progress in the 
word metric on the group. 

Masur and Minsky [12j show that there is a relative metric on the mapping class group, under 
which the group is quasi-isomctric to the complex of curves. A relative metric is a word metric on 
the group with respect to an infinite generating set, consisting of a finite generating set, union a 
finite collection of subgroups. In this case, the collection of subgroups consists of stabilizers of simple 
closed curves a 2 ; , where the a>i consist of representatives for orbits of simple closed curves under the 
action of the mapping class group. The complex of curves is a simplicial complex whose vertices 
are isotopy classes of essential simple closed curves, and whose simplices are spanned by disjoint 
collections of simple closed curves. The quasi-isometry may be explicitly described by choosing a 
bascpoint xq for the complex of curves, and sending g to g(xo). We will write d(l,g) for the length 
of the group element g in the relative metric on G, and this is coarsely equivalent to the distance 
g moves the basepoint in the complex of curves. In this paper we show that a random walk in the 
mapping class group makes linear progress in the relative metric. 

Theorem 1.1. Let G be the mapping class group of an orientable surface of finite type, which is 
not a sphere with three or fewer punctures, and consider the random walk generated by a probability 
distribution fi whose support is bounded in the relative metric, which generates a non- elementary 
subgroup of the mapping class group and which has finite first moment. Then there is a constant 
£ > such that lim n _ ! . 0O -d(l, w n ) = £ almost surely. 

The relative metric is an improper metric on the mapping class group, and therefore not quasi- 
isometric to the word metric, so linear progress in the word metric does not immediately imply linear 
progress in the relative metric. Wc will begin with a brief discussion of the mapping class group of 
the torus, as in this case it is easy to see that the result holds. 

The mapping class group of the torus is isomorphic to the group 5L(2,Z). As there are no 
non-parallel disjoint essential simple closed curves on the torus, the curve complex for the torus 
is usually defined to have edges connecting pairs of curves that intersect exactly once. The curve 
complex for the torus is the Farey triangulation of the unit disc, and 5'L(2,Z) with its word metric 
is quasi-isometric to the dual graph, which is a trivalent tree. This is illustrated below in Figure [T] 
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Figure 1: The Farey triangulation. 

For simplicity, consider a non-backtracking random walk on the trivalcnt tree. Such a path may 
be described by a sequence LRLLRL. . . , where an L denotes a left turn at a vertex, and an R 
denotes a right turn at a vertex. Such a path makes uniform progress in the trivalent graph. The 
path travels distance one in the relative space whenever the next letter in the sequence is different 
from the previous one, and this occurs with probability one-half. So the random walk makes progress 
in the relative space on average at half the rate it makes progress in the trivalent tree. 

We now indicate the argument we use for more complicated mapping class groups. Imagine 
starting at a basepoint x m hyperbolic space, and travelling some large distance R. The set of 
points you may arrive at is given by a sphere centered at xq. Now travel another large distance S, 
as illustrated below in Figure [5] 




Figure 2: Spheres in hyperbolic space. 

Due to negative curvature, nearly all the volume of the second sphere of radius S lies outside the 
sphere of radius R. We wish to translate this intuition into our setting. Our space is the mapping 
class group G with a relative metric, which we shall denote G, which is quasi-isometric to the complex 
of curves. Masur and Minsky [T2] showed that this space is a (non-proper) (5-hyperbolic space, and 
Klarreich [TU] identified the Gromov boundary of the space as the space of foliations in VM.T which 
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contain no closed trajectories. Let fj, be a probability distribution on G, whose support generates 
a non-elementary subgroup. Kaimanovich and Masur showed that for the random walk determined 
by /i, almost all sample paths converge to uniquely ergodic, and hence minimal, foliations and this 
defines a harmonic measure v on the boundary, where v{X) is the probability that a sample path 
converges to a foliation contained in the set X. The harmonic measure v governs the long time 
behaviour of the sample paths, and is the weak-* limit of the ro-fold convolutions of [i on G union 
its boundary. In particular, if we start at the identity and consider all sample paths of length R, for 
R large, the distribution of endpoints looks similar to v, at least when viewed from the identity. If 
we now continue the random walk for another S steps, negative curvature leads us to expect that 
most of the new sample points will lie a definite distance further away from the origin. 

In order to make this intuition precise, we need some way of comparing the harmonic measure 
v with the convolution measures fi n . We will compare the measures on sets which are half spaces. 
A halfspace H(x,y) consists of all points closer (in G) to y than x, and we will be most interested 
in halfspaces H(l,x), where 1 is the identity element in G. In Section [5] we will show that the 
harmonic measure of a halfspace H(l,x) decays exponentially in d(l,x), i.e. there is a constant 
L < 1 such that v(H(l,x)) ^ L d ^ 1,x \ for all x sufficiently far from 1, where H(l,x) is the closure of 
H(l,x). Furthermore, we will show there is a constant Q such that fi n (H(l, x)) ^ QL d ( 1,x \ for all 
x sufficiently far from 1, and these estimates will allow us to relate the harmonic and convolution 
measures. We briefly indicate why one expect these estimates to hold. First observe that there 
is some number K such that the harmonic measure of all halfspaces H(l,x), with d(l,x) K is 
bounded away from 1, at most 1 — e, say. Then given a halfspace H(l,xi), where d(l,xi) is large, 
one may construct a nested sequence of half spaces H(l,x n ) D H(l,x n -x) D . . . D H{1,X\). The 
number of such halfspaces is linear in d(l,Xj), and furthermore, we may assume that the distance 
between any point in H(l, x{) and any point in the complement of H(l,Xi + i) is larger than K. The 
conditional probability that a sample path converges into H(l, x t ), given that it hits a point in the 
complement of H(l, Xj+i) is at most 1 — e, so the harmonic measure of the innermost halfspace is 
at most (1 — e) n . In order to bound /U n in terms of v, suppose a large amount of the mass of fi n is 
contained in H(l,Xi), then at most (1 — e)fj, n (H(l,Xi)) of the mass can escape back out in to the 
complement of H(l, Xj+i), so this gives the upper bound for /z n . 

We will use the estimates described above to find a positive lower bound on the expected extra 
distance from 1 obtained by taking an extra m steps, after a random walk of length n, for m 
sufficiently large, i.e. we will show E(d(l,w n + m ) — d(l,w n )) ^ S > 0. Therefore E(d(l,Wk m )) ^ kS, 
so the expected distance from the identity grows linearly, and a standard application of Kingman's 
subadditive ergodic theorem [5] shows that this will then be true for almost all sample paths as well. 

We remark that there are distance non-increasing maps to the relative space from other useful 
spaces on which the mapping class group acts, such as Teichmuller space and the pants complex, so 
linear progress in the complex of curves implies linear progress in these spaces too. In particular, 
work of Duchin [3], shows that linear progress in Teichmuller space implies that for almost all 
sample paths there is a geodesic which the random walk tracks sublinearly, at least for the parts of 
the geodesic in the thick part of Teichmuller space. 

In Section [2j we recall some standard definitions and set up some notation. In Section [3] we 
prove some useful results about halfspaces in non-proper (5-hyperbolic metric spaces. As we do not 
assume that fi is symmetric, it will be convenient for us to know that a semi-group in the mapping 
class group contains a pair of independent pseudo-Anosov elements, if and only if it generates a non- 
elementary subgroup, and we show this in Section [4j in a straight forward extension of some results 
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of Ivanov [5] . In Section [5] we show that the harmonic measure of halfspaces in the relative space 
decays exponentially in the distance of the halfspace from the basepoint, and obtain the estimate 
for the convolution measures. Finally in Section [5] we find a positive lower bound for the expected 
difference between d(l, w n+m ) and d(l, w n ), and then apply Kingman's subadditive crgodic theorem 
to show that a random walk makes linear progress in the relative space. 

1.1 Acknowledgements 
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NSF grant DMS-0706764. 

2 Preliminaries 

Let £ be an orientable surface of finite type, i.e. a surface of genus g with p marked points, 
usually referred to as punctures. The mapping class group G of £ consists of orientation preserving 
diffcomorphisms of £ which preserve the punctures, modulo those isotopic to the identity. For the 
purposes of this paper we shall assume that £ is not a sphere with three or fewer punctures. 

The collection of essential simple closed curves in the surface may be made in to a simplicial 
complex, called the the complex of curves, which we shall denote C(£). The vertices of this complex 
are isotopy classes of simple closed curves in £, and a collection of vertices spans a simplex if 
representatives of the curves can be realised disjointly in the surface. The complex of curves is a 
finite dimensional simplicial complex, but it is not locally finite. We will write Cq(£) to denote the 
vertices of the simplicial complex C(£), which is the set of isotopy classes of simple closed curves. 
We will write dc(x,y) for the distance in the one-skeleton between two vertices x and y of the 
complex of curves. We will always consider the complex of curves to have a basepoint xq, which we 
can take to be one of the curves corresponding to a standard generating set for the mapping class 
group. The mapping class group acts by simplicial isometries on the complex of curves. For certain 
sporadic surfaces the definition above produces a collection of disconnected points, and so a slightly 
different definition is used. If the surface is a torus with at most one puncture, then two vertices 
are connected by an edge if the corresponding simple closed curves may be isotoped to intersect 
transversely exactly once. If the surfaces is a four punctured sphere, then two vertices are connected 
by an edge if the corresponding simple closed curves may be isotoped to intersect transversely in 
two points. In both of these cases, the resulting curve complex is isomorphic to the Farey graph. 

A geodesic metric space is 5 -hyperbolic if every geodesic triangle is <5-slim, i.e. each edge is 
contained in a (^-neighbourhood of the other two. Masur and Minsky [TJ] have shown that the 
complex of curves is 5-hyperbolic. 

The mapping class group is finitely generated, so any choice of generating set A gives rise to a 
word metric on G, and any two different choices of finite generating set give quasi-isometric word 
metrics. Given a group G, and a collection of subgroups % = {Hi}i£i, we define the relative length of 
a group element g to be the length of the shortest word in the typically infinite generating set AWH. 
This defines a metric on G called the relative metric, which depends on the choice of subgroups %. 
We will write G to denote the group G with the relative metric, which we shall also refer to as the 
relative space. We say a finitely generated group G is weakly relatively hyperbolic, relative to a finite 
list of subgroups H, if the relative space G is S- hyperbolic. 
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We may consider the relative metric on the mapping class group with respect to the following 
collection of subgroups. Let {cti, . . . , a n } be a list of representatives of orbits of simple closed curves 
in S, under the action of the mapping class group. Let Hi = fix(aj) be the subgroup of G fixing 
ctj. Masur and Minsky [TJ] have shown that the resulting relative space is quasi- isometric to the 
complex of curves. As the complex of curves is J-hyperbolic, this shows that the mapping class group 
is weakly relatively hyperbolic. Klarreich [TU], see also Hamenstadt @], showed that the Gromov 
boundary of the complex of curves is the space J 7 m ini which we now describe. The space T m in 
consists of the subset of VM.T consisting of foliations which contain no closed trajectories, up to 
equivalence, where two measured foliations are equivalent if they are topologically equivalent. The 
topology on T m i n is the induced topology from VMT. 

We now review some background on random walks on groups, see for example Woess |14j . Let 
G be the mapping class group of an orientable surface of finite type, which is not a sphere with 
three or fewer punctures, and let fi be a probability distribution on G. We may use the probability 
distribution /i to generate a Markov chain, or random walk on G, with transition probabilities 
p(x, y) = n(x~ 1 y). We shall always assume that we start at time zero at the identity element of the 
group. The path space for the random walk is the probability space (G Z +,P), where G z+ is the set 
of all infinite sequences of elements G. We will write w n for the random variable corresponding to 
projection onto the n-th factor, which gives the position of the sample path at time n. The position 
of the random walk at time n may be described as the product mim 2 . . . m n , where the m, are the 
increments of the random walk, i.e. the are a sequence of independent /^-distributed random 
variables. Therefore the distribution of random walks at time n is given by the n-fold convolution of 
/i, which we shall write as \x n , and we shall write p^ n ' (x, y) for the probability that you go from x to y 
in n steps. The probability measure P is determined by fi n using the Kolmogorov extension theorem. 
The reflected random walk is the walk generated by the reflected measure Jl, where Jl(g) = ^(g^ 1 ). 
The Bernoulli shift in the space of increments of the random walk determines a measure-preserving 
crgodic transformation on (G Z+ ,P) determined by (Uw) n = w% w n +i. 

We shall always require that the group generated by the support of fi is non- elementary, which 
means that it contains a pair of pscudo-Anosov elements with distinct fixed points in V M. T , the 
space of projective measured foliations on the surface. We do not assume that the probability 
distribution /i is symmetric, so the group generated by the support of n may be strictly larger than 
the semi-group generated by the support of /i. Finally, we shall always assume that the probability 
distribution /i has finite first moment with respect to the word metric on G. 

In [TT] , we showed that it followed from results of Kaimanovich and Masur [UJ and Klarreich [TU] , 
that a sample path converges almost surely to a uniquely ergodic foliation in the Gromov boundary 
of the relative space. This gives a measure v on Fmin, known as harmonic measure. The harmonic 
measure v is /^-stationary, i.e. 

v(x) = Y,K9)v{g- 1 x). 

<?eG 

Theorem 2.1. [6l ll0i[TTj Consider a random walk on the mapping class group of an orientable 
surface of finite type, which is not a sphere with three or fewer punctures, determined by a probability 
distribution fi such that the group generated by the support of /i is non- elementary. Then a sample 
path {w„(uj)} converges to a uniquely ergodic foliation in the Gromov boundary Train of the relative 
space G almost surely, and the distribution of limit points on the boundary is given by a unique 
fx- stationary, non- atomic measure v on T m % n . 

We remark that the measure v is supported on the uniquely ergodic foliations, which are a subset 
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of VM.J-, so we may think of v as a measure on VMJ 7 , with zero weight on all the non-uniquely 
ergodic measures. 

3 Halfspaces 

In this section we give detailed proofs of various useful properties of halfspaces in a non-locally 
compact (5-hypcrbolic space. These properties are presumably well known for locally compact 8- 
hyperbolic spaces, but we provide complete proofs to verify that these properties hold in the non- 
locally compact case. For consistency with the other sections of this paper, we will denote our 
non-locally compact 5-hyperbolic metric space by G, and we will write d(x, y) for the distance in G 
between two points x and y. Furthermore, we will always have a distinguished base point, which we 
shall call 1. 

Two points a and & in a metric space define a halfspace H(a, b) consisting of all those points 
which are at least as close to b as to a, i.e. H(a,b) = {x G G d(x,b) ^ d(x,a)}. The main two 
results of this section are Propositions 13.91 and 13.101 which we now briefly describe. Proposition 13.91 
says if two halfspaces H(l,x) and H(l,y) are small and far apart, then any other small halfspace 
hits at most one of H(l,x) or H(l,y). The halfspaces H(l,x) and H(l,y) arc small if d(l,x) and 
d(l,y) are large, and they are far apart if the geodesic [x,y] passes close to the basepoint 1. In 
this case any other halfspace H(l,z) hits at most one of H(l,x) or H(l,y), as long as d(l,z) is 
sufficiently large. Proposition 13. 101 savs that given a halfspace H(l,x), we may choose a point y on 
a geodesic [1, x] close to x such that the halfspace H(l, x) is contained in the halfspace H(l, y). Wc 
also obtain specific bounds on how large the halfspace H(l, x) appears when viewed from any point 
in H(y, 1), and similar bounds on how large the halfspace H(y, 1) appears when viewed from any 
point in H(l, x). To be more precise, we show that for any point a in H(y, 1), the halfspace H(l, x) 
is contained in a halfspace H(a,b), with an explicit lower bound on d{a,b). Furthermore, for any 
point b in H(l,x), there is a halfspace H(b,a) such that H(y, 1) C H(b, a), again, with an explicit 
lower bound on d(a, b). 

We begin with some elementary observations about nearest point projections. In a <5-hyperbolic 
space nearest point projections onto quasi-convex sets are coarsely well defined. We now show that 
for any set X, the K- neighbour hoods of X, and a particular choice of shortest path to X, have 
bounded intersection. 

Proposition 3.1. Let X be a set, and let [z,p] be a minimal length geodesic from a point z to X. 
Then the intersection of the K -neighbourhoods of X and [z,p] is contained in a 3 K -neighbourhood 
ofp- 
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Figure 3: A shortest path from z to X. 

Proof. Choose a point q in the intersection of Nk{X) and Nk([z,p]). There are points r € X and 
s G [z,p] such that d(r,q) ^ K and d(s,q) < K, as illustrated above in Figure |3] If d(p,q) > 3K, 
then as d(s, q) ^ K, this implies that d(p, s) > 2K. Therefore r is a closer point on [x, y] to s, and 
hence to z, than p, which contradicts the fact that p is a closest point on X to z. So d(p, q) 3/v . 
and so N K (X) n Afc C N 3K (p), as required. □ 

This shows that, up to additive error, the shortest way to get from a point z to a point x on a 
geodesic, is to head to the closest point to z on the geodesic, and then run along the geodesic to x. 
We will make extensive use of this fact, so we record it here as a proposition. 

Proposition 3.2. Let [x,y] be a geodesic from x to y, and p a closest point on [x,y] to z. Any 
geodesic [x,z] from x to z intersects a 36 -neighbourhood of p, so [x,p] U \p, z] is contained in a 
3(5 -neighbourhood of [x, z\. In particular G?(x,p) + d(p, z) — 66 ^ e?(x, z) ^ d(x,p) + d{p, z). 

Proof. The right hand inequality is just the triangle inequality. We now justify the left hand in- 
equality. By thin triangles, any geodesic [x, z] from x to z is contained in a ^-neighbourhood of the 
union of geodesies [x,p] and [p, z]. If [p, x] and [p, z] had long initial segments that fellow travelled, 
then the distance from x to z might be much shorter than the sum of the distances from x to p and 
from z to p. However, this would contradict the fact that p was a closest point on [x, y] to z. To be 
precise, Proposition 13.11 implies that the intersection of the 6- neighbourhoods of [x,p] and [p,z] is 
contained in a 3(5-neighbourhood of p. This means that any geodesic from z to x must pass within 
36 of p, so d(z,p) + d{p, x) — 66 ^ d(z, x), as required. □ 

An immediate consequence of Proposition 13.21 is the following quantitative version of the fact 
that nearest point projection is coarsely well defined. 

Proposition 3.3. Let p and q be nearest points to z on a geodesic [x, y] . Then d(p, q) 66. 

Let a and b have nearest point projections p and q onto a geodesic [x, y]. We now show that if 
d(p,q) > 146 apart, then, up to additive error, the geodesic from a to b goes from a to p, then runs 
along the geodesic from p to q, and then heads back out to b. 

Proposition 3.4. Let [x,y] be a geodesic and let p be a closest point on [x,y] to a, and let q be a 
closest point on [x, y] to b. If d(p, q) > 146 then d(a, b) ^ d(a,p) + d(p, q) + d(q, b) — 246. 
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Proof. Let [x, y] be a geodesic from x to y. Let [a,p] be a minimal length geodesic from a to [x,y], 
and let [6, q] be a minimal length geodesic from b to [x, J/]. First we show that N2s([a,p\) and 
N2s([b,q]) are disjoint. Suppose not, then let r be a point in A^2a([a,p]) n N25 ( [6, g] ) . Then there 
are points s <G [a,p] and t £ such that d(s,r) ^ 25 and d(t,r) ^ 25, as illustrated in Figured 
below. 



x p q y 




Figure 4: Shortest paths from a and b to [a;, y\. 

By the triangle inequality, d(t,p) ^ s) + d(s,p). As c2(t, s) ^ 45, this shows that 

d(t,p) ^45 + d(s,p). (3.1) 
By Proposition the path from t to p via g is almost a geodesic, i.e. 

d(t,q) + d{q,p) -66 ^d(t,p). (3.2) 
Combining (|3.1[) and (|3.2p , we obtain 

d(t, q) + d{q, p) - 65 d(s,p) + 45. (3.3) 

As [a,p] is a minimal length geodesic from a to [a;, y], it is also a minimal length geodesic from any 
point on [a,p] to [x, y]. Therefore p is a closest point on [x, y] to s, so the distance from s to p is 
less than or equal to the distance from s to [x,y\ by a path through t. This implies that 

d(s,p) ^ A5 + d{t,q). (3.4) 

Combining (|3.3[) and p.4[) , and subtracting d(t,q) from both sides, implies that d(p,q) 145. 
However we assumed that d(p,q) > 145, so this implies that N2s{[a,p\) and A^Qfr, q}) are in fact 
disjoint. 

By thin triangles, any geodesic [a, b] is contained in 25-ncighbourhood of [a,p] U [p, q] U [q, b]. By 
Proposition l3.11 the intersection of N 2 s([a,p}) with N2s([x,y]) is contained in a 65-neighbourhood of 
p. Similarly, the intersection of N2&([b, q\) with N2s([x, y]) is contained in a 65- neighbourhood of q. 
As the remaining parts of the 25-neighbourhoods outside Nq$(p) and Nss(q) are disjoint, this means 
that d(a, b) ^ d(a,p) + d(p, q) + d(q, b) — 245, as required. □ 
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We now show that nearest point projection to a connected subgeodesic of a geodesic is coarsely 
equivalent to nearest point projection to the original geodesic, followed by nearest point projection 
to the subgeodesic. 

Proposition 3.5. Let p\ be nearest 'point projection onto a geodesic [a,b], and let p2 be nearest 
point projection onto a subgeodesic [c,d] C [a, b]. Then there is a constant K\, which only depends 
on 5, such that d(p2(x), P2{p±(x))) ^ K±, for any point x. 

Proof. Let p be the nearest point projection of x to [a, b], i.e. p = pi(x). If q is another point on 
[a, b], then the path from x to q via p is almost a geodesic, Proposition l3.2[ so d(x,p) + d(p, q) — 65 ^ 
d(x, q) ^ d{x,p) + d(p, q). Therefore, if p lies in the subgeodesic [c, d], then ^(p) = p, and any point 
in [c, d] further than 65 from p is further away from x than p. On the other hand, if p lies outside 
[c, d], then p2(p) is equal to one of the endpoints, which we may assume is c, up to relabelling. Again, 
Proposition 13.41 implies that any point on [c, d] more than 65 away from c is further away from x 
than c. So we may choose K\ to be 75. □ 

We now show that that if a path 7 lies in a bounded neighbourhood of a geodesic, then the 
nearest point projections of any point to the path and to the geodesic are a bounded distance apart. 

Proposition 3.6. Let [x,y] be a geodesic, and let 7 be any path from x to y contained in a K- 
neighbourhood of [x, y\. For any point z, let p be a closest point on [x, y] to z, and let q be a closest 
point on 7 to z. Then d(p, q) ^ 3K + 65. 

Proof. Let p be the closest point to z on [a;, j/]. As 7 is a path from [x,y] contained in a K- 
neighbourhood of [x, y], there is a point p' in 7 such that d(p,p') ^ K, and hence d(p',z) ^ 
d(z,p) + K . Let q be the closest point on 7 to z, and let q' be the closest point on [x, y] to q, so the 
distance from d(q,q') ^ K. This is illustrated below in Figure [5] 




Figure 5: A path close to a geodesic [x, y]. 

By Proposition 13.21 the path from z to q' via p is almost a geodesic, so d(z,q') > d(z,p) + 
d(p, q') — 65, which implies that d(z, q) > d(z,p) + d(p, q')—65 — K. Therefore if d(p, q') > 2K + 65, 
then d(z,q) > d(z,p r ), which contradicts our choice of q as a closest point on 7 to z. Therefore 
d(p, q) < 3K + 65, as required. □ 

Recall that two points x and y define a halfspace H(x, y). We now show that the image of the 
halfspace under the nearest point projection to a geodesic [x, y] between x and y is contained in a 
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bounded neighbourhood of the half-segment of [x, y] closest to y. As a partial converse, we show 
that if the nearest point projection of z lies sufficiently close to y, then z € H(x,y). 

Proposition 3.7. Let z G H(x,y), and let p be the nearest point to z on a geodesic [x,y]. Then 
d(y,p) ^ \d{x, y) + 36. Conversely, if d(y,p) ^ \d{x, y) — 38, then z 6 H(x, y). 

Proof. Let z be a point in the halfspace H(x,y), let [x, y] be a geodesic from x to y, and let p be a 
closest point on [x,y] to z. This is illustrated below in Figure |6] 







z 


IE 






H(x,y) 





V 



Figure 6: A point z in the halfspace H{x, y). 

The point z lies in the halfspace H[x,y), so d(z,y) By the triangle inequality, the 

distance from z to x is less than or equal to the distance from z to x via p, which implies 

d(z,y) ^ d(z,p) +d(p,x). 

By Proposition ^. 21 the path from z to y via p is close to being a geodesic, i.e. d(z,p)+d(p, y) — 6S^ 
d(z,y). 

d(z,p) + d{p, y) - 65 ^ d(z,p) + d(p, x) 

We may subtract d(z,p) from both sides. As p lies on the geodesic [x, y], the distance from x to y is 
equal to the distance from x to p plus the distance from p to j/, i.e. d(x,y) = d(x,p) + d(p,y). This 
gives the required inequality, d(p,y) ^ ^d(x,y) + 36. 

For the converse, note that if z does not lie in the halfspace H(x, y), then z lies in the halfspace 
H(y, x), so d(p, x) ^ ^d(x, y)+35. The point p lies on the geodesic [x,y], so d(p,x) — d(x,y)~d(p,y), 
which implies ^d(x,y) ^ d(p,y) + 35. Therefore, if d(y,p) ^ i<i(x,?/) — 3<5 then this implies z 6 
H(x,y), as required. □ 

We have shown that the image of the nearest point projection of a halfspace H(x,y) to [x,y] is 
close to being half of this geodesic segment. If a geodesic segment [x, z] fellow travels with [x, y] for 
a sufficiently large initial segment, then we can estimate the image of the nearest point projection 
of H{x,y) onto [x, z]. This will be the case as long as the nearest point projection of z to [x,y] is 
sufficiently far from x. We now make this precise in the following proposition. 

Proposition 3.8. There are constants K2 and K3, which only depend on 6, such that if z has 
nearest point projection p to a geodesic [x,y], and d(p,x) ^ ^d(x, y) + K2, then the nearest point 
projection of H(y,x) to [x,z] is distance at most ^d(x,y) + K3 from x. 
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Proof. We shall choose K 2 to be 275 and K 3 to be 185. Let H (y, x) be the halfspace defined by the 
pair of points x and y, and let [x,y] be a geodesic from x to y. Let p be a nearest point to z on 
[a:, j/]. Let a e H(y,x), and let g be the closest point on [x,y] to a. This is illustrated in Figure [7] 
below. 
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Figure 7: Closest point projection of H(y,x) onto [x,z]. 

By Proposition 13.71 nearest point projection maps the halfspace H(y,x) to roughly half the 
geodesic [x,y]. This implies that d(q,x) ^ ^d(x,y) + 35, so d(q,p) ^ K 2 — 35. Let t be a point on 
\p,z], then as K 2 ^ 175, Proposition 13.41 implies that d(a,t) > d(a,q) + d(q,p) + d(p,t) — 245. As 
we chose K 2 to be 275, this implies d(p, q) ^ 245, and so the closest point to a on [x,p] U [p, z] lies 
on [x,p\. As [x,p] C [x, y], Proposition 13.71 implies that the nearest point projection of H{y,x) to 
[x,p] U [p, z] is distance at most ^d(x, y) + 35 from x. 

Let r be the nearest point to a on the geodesic [x, z\, and let s be the nearest point to a on the 
path [x,p] U [p, z\. By Proposition [321 the path from x to z via p is contained in a 35-neighbourhood 
of a geodesic [x, z], so as nearest point projections to close paths are close, Proposition 13.61 this 
implies d(r, s) ^ 3(35) + 65 = 155. Therefore the nearest point projection of H (y, x) to [x, z] is 
distance at most ^d(x,y) + 185 from x, and so we may choose K3 to be 185. □ 

We now show that if H(l,x) and H(l,y) are two halfspaces far from the origin, such that the 
geodesic from [x, y] passes close to the origin, then any other halfspace far from the origin hits at 
most one of H(l,x) or H(l,y). 

Proposition 3.9. There are constants and K§, which only depend on 5, such that for any 
geodesic [x, y] with d(l, x) and d(l, y) at least 2d(l, [x, y]) + K4, and for any point z with d(l, z) 
2d(l, [x, y]) + K5, the halfspace H(l, z) hits at most one of the halfspaces H(l, x) and H(l, y). 

We remark that the hypotheses may be restated in terms of the Gromov product, (x\y) = 
7}(d(l,x) + d(l,y) — d(x,y)), as in a 5-hyperbolic space the distance from the basepoint 1 to a 
geodesic [x, y] is coarsely equivalent to the Gromov product of x and y. 

Proof. We shall choose K4 to be 2K 2 + K 3 + 425, and K 5 to be 245. Let [x, y] be a geodesic from 
x to y, and let p be a closest point on [x, y] to 1. It will be convenient to use the following form of 
our assumption that the distance from 1 to a; is larger than twice the distance from 1 to p. 

\d{l,x)>d{l,p) + \K i (3.5) 
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Let r be the nearest point on [l,x] to y, as illustrated below in Figure [5] We start by showing 
that r is a bounded distance from p. By Proposition 13.21 the path from y to 1 via p is almost a 
geodesic, so if t is a point on [l,p] then d(t,y) > d(y,p) + d(p,t) — 66. This implies that the closest 
point projection of y to [x,p] U [p, 1] is distance at most 66 from p. As the path [l,p] U \p,x] lies 
in a 3<5- neighbourhood of the geodesic [l,x], as nearest point projections to close paths are close, 
Proposition 13. 61 this implies that 

d(r,p) < 3(36) + 66 + 66 = 216. (3.6) 




Figure 8: Disjoint halfspaces. 

In order to use Proposition ^. 8l to estimate the size of the nearest point projection of the halfspace 
H(l,x) to the geodesic [x, y] we need to show that that r is closer to 1 than x. As r lies on a geodesic 
from 1 to x, 

d(x, r) = d(l, x) — d(l, r). 
Using our initial assumption, line p.5j) . we may rewrite this as 

d(x,r) > ±d(l,x) + d(l,p) - d{l,r) + \K A . 
Using the bound for d(r,p) from line p.6j) and the triangle inequality, we obtain 

d(x,r) ^ %d(l,x) + \K A - 216. 

Therefore we may apply Proposition l3.81 as we chose K4 ^ 2K2 + 426. This implies that the nearest 
point projection of H(l,x) to [x,y] is contained in a -neighbourhood of x. 

We now show that the nearest point projection of H(l,x) to [x,y] is a definite distance away 
from p. Let s be a point in the nearest point projection of H(l, x) to [a;, y], so we have just shown 
that d(s,x) ^ 7}d(l,x) + A'3. As s lies on the geodesic [x,p], this implies 

d(s,p) ^ d(x,p) — — A 3 . 

Using the triangle inequality applied to going from 1 to i via p, we obtain 

d(s,p) ^ d{l,x) -d{l,p) - ±d(l,x)-K 3 . 
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Our initial assumption (|3.5[) now implies that d(s,p) ^ K4 — K3, which is greater than zero if 
K4 > K 3 . In fact d(s,p) ^ 145, as we chose K 4 to be 2K 2 + K 3 + 425. 

Our hypotheses are symmetric in x and y, so this also implies identical results for the image of 
the nearest point projection of H(l,y) to [x, y\. In particular, this implies that the nearest point 
projections of H(l,x) and H(l,y) to the geodesic [x,y] are disjoint. 

We now consider a halfspace H(l,z) which intersects both H(l,x) and H(l,y), and show there 
is an upper bound on d(l,z). Assume that H(l,z) intersects both H(l,x) and H(l,y). Let q be 
a closest point on [x,y] to z. Up to relabelling x and y, we may assume that q G \p,y]. This is 
illustrated in Figure [9] below. 
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Figure 9: Three halfspaces. 
Let a be a point lying in the intersection of H(l, x) and H(l, z), so in particular 

d(a,z) < d(l,a). (3.7) 

Let s be a nearest point on [x,y] to a. We showed above that d(s,p) > 145, and therefore d(s,q) > 
145. As s and q are the nearest point projections of a and z onto the geodesic [x, y\, and d{s, q) > 145, 
Proposition 13.41 implies that the path from a to z via s and g is close to being a geodesic, i.e. 
d(a, z) ^ d(a, s) + d(s,p) + d(p 7 q) + d(q, z) — 245. Together with line (|3.7[) , this implies 

d(o, s) +d(s,p) + d(p,q) + d(q,z) - 245 < d(l,a). 
By the triangle inequality, d(l, a) ^ d(l,p) + d(p, s) + d(s, a), which gives 

d(a,s) + d(s,p) + d{p,q) + d{q,z) - 245 < d(l,p) + d(p,s) + d(s,a). 
Adding d(l,p) — d(a, s) — d(s,p) to both sides, we obtain 

d(l,p) + d(p,q) + d{q, z) < 2d(l,p) + 245. 

Using the triangle inequality, this shows that d(l, z) ^ 2d(l,p) + 245. So we may choose K$ to be 
245. □ 
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We now show that if H(l, x) is a halfspace far from the origin, then it is contained in a halfspace 
H(l,y), with y a bounded distance closer to the origin, and we find an explicit lower bound on the 
distance between H(l,x) and H(y, 1). Furthermore, we show that for any point a in H(y, 1), there 
is a point b such that the halfspace H{a, b) contains H(l, x), with an explicit lower bound on d{a, b). 
Similarly, we show that for any point b in H{l,x), there is a point a such that H(y, 1) C H(b,a), 
again with an explicit lower bound on d(a, b). 

Proposition 3.10. There are constants Kq and K 7 , which only depend on 5, such that for any 
positive number A, and any point x with d(l,x) Kq + 4 A, there is a point y on [l,x] with 
d(l,y) = d(l,x) — 2Kj — 2A, such that H(l,x) C H(l,y). Furthermore, for any point a in H(y, 1) 
there is a point b in H(l,x), with d(a,b) 2A, and H(l,x) C H(a,b). Also, for any point b in 
H(l, x), there is a point a in H(y, 1) with d{a, b) 2A, and H(y, 1) C H (b, a). 

Proof. We shall choose K 7 to be 985 + 2K X and K 6 to be 2K 7 + 65. We first show that H(l, x) C 
H(l, y). Let p be a point in the nearest point projection of H(y, 1) to [1, x], and let q be a point in 
the nearest point projection of H(l,x) to [l,x]. Applying Proposition 13.71 to the halfspace H(l,y) 
gives 

d(l,p) ^ \d{l,y) + 2,5. 
Using our assumption that d(l,y) = d(l,x) — 2K 7 — 2A implies 

d(l,p)^±d(l,x) + 38-A-K 7 . (3.8) 

Now applying Proposition 13 . 71 to the halfspace H(l,x), we obtain 

d(q, 1) ^ \d{\, x) - 35. (3.9) 

Comparing (|3.8p and (|3.9p shows that if K 7 ^ 135, then d(p,q) ^ 75, and so H(l,x) and H{y, 1) 
are disjoint, which implies H(l,x) C H(l,y), as required. 
We now prove the second statement in Proposition ^. 101 

Claim 3.11. For any point a in H{y,\) there is a point b in H(l,x), with d(a,b) ^ 2A, and 
H(l,x) C H(a,b). 

Proof. Given a point a in H(y, 1) we will choose a point b in the geodesic [1, x], sufficiently far from 
a, and then show that H(l,x) C H{a,b) by showing that the nearest point projections of H(l,x) 
and H(b,a) to [l,x] are sufficiently far apart. 

Let a be a point in the halfspace H(y, 1), let p be a nearest point projection of a to [1, x] and let 
b be a point on the geodesic [l,x] distance d(l,p) + K 7 from x. This is illustrated below in Figure 

ma 
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Figure 10: The halfspace H(l,x) is contained in the halfspace H(a,b). 

We first find a lower bound for the distance between a and b. Using Proposition ^. 21 which says 
that the path [a,p] U [p, b] is contained in a 35-ncighbourhood of [a, b], and the triangle inequality, 
we obtain 

d(a,b) ^ d(p,b) - Q5. (3.10) 

Let z be the midpoint of [1, x]. Then d(p, z) ^ K7+A— 3(5, by Proposition ^. 71 and our assumption on 
the distance between the halfspaces H(l, x) and H(l, y). Our choice of b then implies d(z, b) A— 35. 
Therefore d(p, b) ^ K7 + 2A — 125, and this is at least 2A, as required, as we chose K7 ^ 128. 

We now show that the halfspace H(l, x) is contained in the halfspace H (a, b). As nearest point 
projection is coarsely well-defined, Proposition 13.31 it suffices to show that the nearest point pro- 
jections of H(l,x) and H(b,a) to [l,x] are distance at least 75 apart. By Proposition 13.71 if r is a 
point in the nearest point projection of H(l,x) to [l,x] then 

d(r,x)^±d(l,x) + 36. (3.11) 

Similarly, the nearest point projection of H (b, a) to [a, b] is distance at least ^d(a, b) — 35 from 
b. Let s be a closest point on [a, b] to p. By Proposition 13.51 if r is a point in the nearest point 
projection of H(b, a) to the subgeodesic [s, b] of [a, b], then 

d(r,b) ^ \d{a,b) -35- K x . 

By Proposition 13. 2\ d(s,p) 35, and the path [s,p] U [p, b] is contained in a 3i5-neighbourhood of 
[s,b]. Therefore, as nearest point projections to close paths are close, Proposition 13.6) this implies 
that if r is a nearest point in [s,p] U [p, b) to H (b, a) then 

d(r,b) ^ \d{a,b) - 185 - K x . 

Nearest point projection onto the path [s,p] U [p, b] is the same as nearest point projection onto the 
path \p, s] U [s,p] U [p, b], and this latter path is contained in a 35-neighbourhood of [p, b], so again 
applying Proposition 13.61 if r lies in the nearest point projection of H(b, a) to [p, b] then 

d(r,b) ^ \d{a,b) - 335 -Ky. 
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Using (|3.10[) . and the fact that [p, b] is a geodesic subsegment of [l,x], we obtain 

d{r,b) ^ §d(l,x) - \d{l,p) - \d(b,x) - 36<5 - K x . 
The points r and b lie on the geodesic [1, a;], and d(b, x) = d(l,p) + K 7 from x, so this implies 

d{r,x) > ±d(l,x) + ±A' 7 - 36(5 - (3.12) 

Comparing p. lip with fl3.12[) shows that the distance between the projections of the two halfspaces 
to [l,x] is at least 75, as K 7 ^ 925 + 2K\. Therefore the nearest point projections of H(l,x) and 
H(b, a) are sufficiently far apart, and so H(l, x) C H(a, b), as required. □ 

We now prove the hnal statement, by an analogous argument to the one above, though unfortu- 
nately not exactly the same, as the picture is not completely symmetric. 

Claim 3.12. For any point b in H(l,x), there is a point a in H(y,l) with d(a,b) ^ 2A, and 
H{y,l)cH(b,a). 

Proof. Let b be a point in the halfspace H(l, x), and let p be a nearest point projection of b to [1, a;]. 
Let a be a point on the geodesic [l,x] distance d(p,x) — 2 A — Kj from 1, and this is greater than 
zero, as d(l, x) AA + 2Kj + 66. This is illustrated below in Figure [TTJ 
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Figure 11: The halfspace H(y, 1) is contained in the halfspace H(b, a). 

We start by showing that d(a, b) 2 A. Using Proposition 13. 21 which says that the path [a,p] U 
[p, b] is contained in a 3(5- neighbourhood of [a, b], and the triangle inequality, we obtain 

d(a,b)^d(a,p)-65. (3.13) 

Proposition I3.7[ d(p,x) ^ hd(l,x) + 3(5, and our choice of a, implies that d(l,a) ^ hd(l,x) + 3(5 — 
2A - K 7 . Therefore d(a,p) ^2A + K 7 - 66, and so d(a, b) ^ 2A as K 7 ^ 126. 

We wish to show that H{y,l) C H(b,a), and as before it suffices to show that the nearest 
point projections of the halfspaces H(y,l) and H(a,b) to the geodesic [l,x] are distance at least 
75 apart, as nearest point projection is coarsely well defined, Proposition 13.31 Let r be a point 
in the nearest point projection of H(y,l) to [l,x] which is furthest from 1. By Proposition 13. 7\ 
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d(l,r) ^ |d(l,j/) + 35. Also, as d(H(y,l),H(l,x)) ^ A + K-j, again using Proposition 13.71 this 
implies d(x, y)~^2A + 2K 7 + 65. Therefore 

d(l,r) ^±d(l,x)-A-K 7 + 65. (3.14) 

Now applying Proposition 13.71 to the halfspace H{a 1 b), implies that if r is a point in the nearest 
point projection of H(a, b) to [a, b], then 

d(a, r) irf(a, b) — 35. 

Let s be a closest point on [a, 6] to p. As [a, s] is a subgeodesic of [a, b], we may use Proposition ^. 5[ 
which implies that if r is a point in the nearest point projection of H(a, b) to [a, s], then 

d(a,r) > |d(a,6)-35- A'i. 

By Proposition 13.21 d(s,p) ^ 3(5, and the path [a,p] U [p,s] is contained in a 35-neighbourhood of 
[a, s]. Therefore, as nearest point projections to close paths are close, Proposition 13.61 this implies 
that if r is a nearest point in [a,p] U [p, s] to H (a, b) then 

d{a,r) ^ \d{a, b) - 185 - K t . 

Nearest point projection onto the path [a,p] U \p, s] is the same as nearest point projection onto the 
path [a,p] U [p,s] U [s,p], and this latter path is contained in a 35-neighbourhood of [a,p], so again 
applying Proposition 13.61 if r lies in the nearest point projection of H(a, b) to [a,p] then 

d(a,r) > ±d(a,6)-335- K x . 
Using (|3.13[) . and the fact that [a,p] is a geodesic subsegment of [l,x], we obtain 

d(l,r) > 5^(1, p) + |d(l,a) - 365- A x . 
As we chose a such that d(l, a) — d(p, x) — 2A — K 7 , this shows 

d(l,r) ^ \d{l,x) -A- \K 7 -3&8-K 1 . (3.15) 

Comparing p,15p with (|3.14[) shows that the distance between the projections of the two halfspaces 
to [l,x] is at least 75, as K 7 > 985 + 2K\. Therefore the nearest point projections of H(y, 1) and 
H(a,b) are sufficiently far apart, and so H(y, 1) C H(b,a), as required. □ 

This completes the proof of Proposition ^. 101 □ 

Finally, we prove a result about a slightly more general definition of halfspaces. Let H(l, x; C) = 
{y I d(y, x) ^ d(l, x) + C}. We allow C to be negative, and if C is zero, this recovers the standard 
definition of a halfspace, and we shall continue to write H(l,x) to mean iJ(l,x;0). We now show 
that a coarse halfspace is contained in a halfspace. 
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Proposition 3.13. The halfspace H(l,x;C) is contained in the halfspace H(l,y), for d(l,y) = 
d(l,x)-C-96, if < C < d(l, x) - 96. 

Proof. Let z G H(l, x; C), and let p be a nearest point to z in [1, x], so d(z, x) ^ d(z, 1) + C. By the 
triangle inequality, d(l, z) ^ d(\,p) + d(p, z), therefore 

d{z, x) s$ d(l,p) + d(p, z) + C. 

The path from z to x via p is almost a geodesic, Proposition ^. 2[ so d(z, x) ^ d(z,p) + d(p, x) — 66. 
This implies 

d(p,x) < d(l,p) + C + 65. 

As p lies on the geodesic [1, x], we may rewrite this as 

d{p,x) < id(l,x) + |C + 35. 

By Proposition 13.71 this implies that z is contained in the halfspace H{l,y), for d(l,y) = d(l,x) — 
C — 96, as required. □ 

4 Non-elementary semi-groups 

We do not assume that our random walk is symmetric, so it will be convenient to know that if 
the group generated by the support of fj, is non-elementary, then the semi-group generated by the 
support of fi contains a pair of independent pseudo-Anosov elements. This follows from well-known 
results on the structure of subgroups of the mapping class group, which we now briefly review. We 
will use the definitions and results of Ivanov [5] , although the results we obtain could also be deduced 
from work of McCarthy [T3] and Birman, Lubotzky and McCarthy [T] . 

We say an element h of the mapping class group is pure, if there is a disjoint collection of simple 
closed curves o~(h) which are fixed individually by h, such that each complementary component of 
o~(h) is also preserved, and furthermore h acts on each complementary component as either a pseudo- 
Anosov element or the identity. If the collection of simple closed curves <r(/i) has the property that 
no simple closed curve with non-zero intersection number with cr{h) is fixed by h, then a(h) is called 
a canonical reduction system for h. If h is not pure, then we define the canonical reduction set 
a(h) to be the canonical reduction set of some pure power of h. Note that cr(/p/ _1 ) = fa(g). The 
canonical reduction system of a periodic or pseudo-Anosov element of the mapping class group is 
empty. Ivanov shows that given reducible elements of the mapping class group / and g, there is a 
product of / and g whose canonical reduction set is the intersection of the canonical reduction sets 
of / and g. 

Lemma 4.1. [5] Lemma 5.1] Let f and g be reducible elements of the mapping class group. Then 
there are positive numbers k and I, such that the canonical reduction set of f k g l is the intersection 
of the canonical reduction sets for f and g. 

A pseudo-Anosov element / acts on V M.JF with simple dynamics, there are a pair of fixed points, 
which we shall refer to as the stable fixed point Xt and the unstable fixed point AT. The stable 
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fixed point is attracting, i.e. every point in VA4J-\ XJ converges to Ay under iteration by /, and 
the unstable fixed point is repelling, i.e. every point in VM.T \ A^ converges to XJ under iteration 
by / . Ivanov [5] shows that a pair of pseudo-Anosov elements either have the same pair of fixed 
points, or they have disjoint fixed points. 

Lemma 4.2. [5J Lemma 5.11] If f and g are pseudo-Anosov elements, then either fix(f) = fix(g), 
or fix(f) Pi fix(g) is empty. 

We will also make use of the following observation from Ivanov [5]. 

Lemma 4.3. Chapter 11, exercise 3(a)] Let f be a pseudo-Anosov element, with fixed points A^. 
Let g be any element such that g(X~f) ^ XJ. Then f n g is pseudo-Anosov for all sufficiently large n. 

We now show that if a non-clcmcntary subgroup M is generated by a semi-group M + , then we 
can find a pair of independent pseudo-Anosov elements in M + . 

Lemma 4.4. Let M + be a semi-group in the mapping class group which generates a non- elementary 
group M . Then M + contains a pair of independent pseudo-Anosov elements. 

Proof. The group M contains two independent pseudo-Anosov elements. We now show that we can 
find a pair of independent pseudo-Anosov elements which in fact lie in M + . 

Suppose that M + does not contain a pair of independent pseudo-Anosov elements, but does con- 
tain at least one pseudo-Anosov element. By Lemma T4. 21 if M + contains pseudo-Anosov elements, 
they must all have common endpoints. Let / be a pseudo-Anosov element of M + , with fixed points 
Ay and XJ. We will now show that all periodic and reducible elements of AI + preserve the fixed 
point of /. 

Suppose g is a periodic element of M + . If g does not preserve the fixed points of /, then gfg^ 1 
is a pseudo-Anosov clement with distinct fixed points, and furthermore lies in M + , as g~ l may be 
written as a positive power of g. So all periodic elements of M + preserve the fixed points of /. 

Suppose g is a reducible element of M + . At least one of g or g 2 has the property that the image 
of Ay is not XJ , so possibly after replacing g by its square, we may assume that g has this property. 
Then by Lemma 14.31 the element f n g is pseudo-Anosov for all sufficiently large n. In particular 
gf n gg~ 1 = gf n is pseudo-Anosov, and lies in AI + . As we have assumed that M + contains no 
independent pseudo-Anosov elements, gf n must have the same fixed points as /, but this implies 
that g preserves the fixed points of /. 

We have shown that if M + contains pseudo-Anosov elements, then they must all have common 
fixed points, and all other elements of M + preserve the set of fixed points. But then the inverses of 
all elements of M + preserve the set of fixed points, so in in fact the entire group M preserves the 
pair of fixed points, which contradicts the fact that M is non-elementary. So we may assume that 
the semi-group M + contains no pseudo-Anosov elements. 

If there is a disjoint collection of essential simple closed curves in the surface E fixed by all 
elements of M + , then this disjoint collection of simple closed curves is also fixed by all inverses of 
elements in M + , and hence by the whole group M. As M is non-elementary it fixes no disjoint 
collection of simple closed curves in E, so this implies that the semi-group M + also fixes no disjoint 
collection of simple closed curves in the surface E. However, it could a priori be the case that all 
reducible elements of M + have common simple closed curves in their canonical reduction sets, but 
these are not preserved by elliptic elements of M + . 
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Let p be the intersection of the canonical reduction sets of all reducible elements of M + , i.e. 
p = f]{<y{g) | g 6 M + ,g reducible}. Lemma |4~T1 shows that given reducible elements of the mapping 
class group / and g, there is an element of the semi-group generated by / and g whose canonical 
reduction set is the intersection of the canonical reduction sets for / and g. This implies that there 
is a reducible element r G AI + with <r(r) = p. 

Suppose g £ M + does not preserve p. Let k be the smallest positive integer such that g k is pure. 
If g is periodic, then k is the order of g, and a(g) is empty. If g is reducible, then p C cr(g). By 
Lemma H.'Sl for all sufficiently large positive integers I, the element r l g k is pseudo-Anosov on each 
component of £ \ p on which either r or g k is pseudo-Anosov. In particular, o~(r l g k ) = p, and r l g k 
is not periodic. As we have assumed M + contains no pseudo-Anosov elements, r l g k must in fact be 
reducible. 

Consider conjugating r l g k by g, i.e. gf l g k g~ 1 = g.f l g k ~ 1 ■ This gives a reducible element of M + 
with canonical reduction set v(gf l g k ~ 1 ) = g<j(f l g k ) = g{p) ^ P- So we can construct an element 
of M + with canonical reduction set pHg(p), which is strictly smaller than p, which contradicts the 
fact that p is the intersection of all of the canonical reduction sets of all of the reducible elements 
of M + . So p is preserved by all elements of M + , and hence by the inverse of all elements of AI + , 
and therefore by the entire group M. But then the group M is reducible, not non-elementary, a 
contradiction. □ 



5 Exponential decay 

In this section we show that the harmonic measure of halfspaces H{l,x) decay exponentially with 
their distance from the origin. As the number of disjoint halfspaces H(l,x) grows exponentially in 
d(l, x), this is clearly true on average, but in this section we show that the measure of all halfspaces 
decays as L d ^ 1,x ^ , for some constant L < 1 which does not depend on the choice of halfspaces. We then 
show that there is a similar estimate for the convolutions measures p n , i.e. p n (H(l, x)) ^ QL^ 1 '*' , 
for some constant Q, and where L < 1 is the same constant as for v. 

We will need to use the fact that the group generated by the support of the random walk is 
non-elementary, and it will be convenient to use the fact that non-elementary subgroups have free 
subgroups that act on the complex of curves in a similar manner to Schottky groups acting on 
hyperbolic space. 

Definition 5.1. We say that a pair of elements a,b € G are a Schottky pair if there are disjoint 
halfspaces A + , A~ , B + and B~ in the relative space G, such that a{G \ A~) C A + , a^ 1 (G \ A + ) C 
A~,b(G \B~)cB+ and &- x (G \ B+) C B~. We will refer to a choice of halfspaces A ± , B ± , with 
the properties above, as Schottky halfspaces for the Schottky pair a, b. O 
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Figure 12: Halfspaces for a Schottky pair. 



This is illustrated in in Figure [12] above. The red lines correspond to axes for a and b. The 
subgroup generated by a Schottky pair is a free group with two generators, in which all non-identity 
elements are pscudo-Anosov. A non-elementary subgroup of the mapping class group contains a 
pair of pseudo-Anosov elements with distinct endpoints in VM.J- ", and these correspond to distinct 
points in the Gromov boundary of the complex of curves by work of Klarreich [lOj . Therefore there 
are integers p and q such that a p and b q are a Schottky pair. Furthermore, we may replace p and q 
with any larger integers. 

We first show that the harmonic measure of a Schottky halfspace is non-zero, for any Schottky 
pair contained in the support of [i. 

Proposition 5.2. Let A + be a Schottky halfspace, whose corresponding pseudo-Anosov element lies 
in the subgroup generated by the support of fi. Then v(A + ) > 0. 

Proof. Suppose v(A+) = 0. The measure v is /x„-invariant, i.e. v(X) — "^2 Pn{q)v{g~ 1 X), so if 
pL n {g) 7^ 0, then is(g~ 1 A + ) = 0. As a lies in the support of p,, there is an n such that p n {a) > 0. 
This implies that v(a~ k A + ) = for all k. However, the complement of the union of the a~ k A + is 
the unstable fixed point of a, which has measure zero as v is non-atomic. This means that the total 
measure v{VAAF) is zero, a contradiction. □ 

We now show that the harmonic measure of H(l, x) is bounded away from 1, at least for d(l, x) 
sufficiently large. 

Proposition 5.3. Let (i be a probability distribution on G whose support generates a non- elementary 
subgroup, and let v be the corresponding harmonic measure. Then there are constants Kg and e > 
such that h>(H(l, x)) ^ 1 — e, for all x with d(l,x) Kg. The constant Kg only depends on the 
support of p. The constant e depends on (i. 

Proof. Let M be the subgroup generated by the support of p. As this subgroup is non-elementary, 
it contains a Schottky pair, {a, b}. As a and 6 are pseudo-Anosov, the sequences a k and b k are 
quasi-geodesic, with distinct endpoints, so possibly after replacing a and b by positive powers of and 
b 3 , we may assume that {H (1, a* 1 ), H(l, b )} are a collection of Schottky halfspaces for {a, b}, and 
min{d(l, a), 3(1, b)} > 2d(l, [a, b]) + K 4 . By Proposition there is a Kg = 2d(l, [a, b]) + K 5 such 
that any halfspace H(l,x) with d(l,x) ^ Kg intersects at most one of the Schottky halfspaces. In 
particular this means that the half space H(l,x) is disjoint from at least three of the Schottky half 
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spaces, so we may choose e to be the minimum of the sum of any three of the harmonic measures of 
the four Schottky halfspaces. □ 



We now show that the harmonic measure of halfspaces H(l,x) decays exponentially in d{l,x) 
as x varies, at least for x with d(l,x) sufficiently large. 

Lemma 5.4. Let \i be a probability distribution on G whose support is relatively bounded and which 
generates a non- elementary subgroup, and let v be the corresponding harmonic measure. Then there 
are constants Kg and L < 1, such that if d(l,x) Kg then u(H(l,x)) ^ L d ^' x ' . Both Kg and L 
depend on fx. 

Proof. As the subgroup generated by the support of (i is non-elementary, the subgroup contains a 
Schottky pair, so by Proposition [5T31 there are constants K s and e such that v(H(l, x)) ^ 1 — e, for 
d(l,x) > K 8 . 

Before considering more general random walks, we explain the argument in the case of the 
nearest neighbour random walk on the Cayley graph. Consider a pair of nested halfspaces H{1,X2) 

and H(l, X2), where SG2) ^ 2K$ + K$, and x\ G [1,2:2] with d(l,xi) = ef (1,2:2) — K% — 2K-j. 
Given a halfspacc H(l,xi), we will refer to the set of points which are equal distance from both 1 
and x\ as the equidistant set for H(l,xi), which we shall denote by E(l,xi). We remark that a 
priori it may be the case that equidistant set E(l,Xi) = H(l,xi) PI H(x\, 1) has measure strictly 
larger than zero. Any sample path that converges into the limit set of H(l,X2j, must pass through 
the equidistant set of H(l,xi). So we can use the formula for conditional probability, conditioning 
on the location at which the sample path first hits the equidistant set of H(l,x\). By Proposition 
13.101 for any point a G E(l,xi), the halfspace H(1,X2) is contained in a halfspace H(a,b), with 
d(a,b) Kg, so the probability that you converge into the limit set of H(1,X2) from any point 
of E(l,xi) is at most 1 — e, so, in particular, the probability that you converge into the limit set 
of H{1,X2) is at most 1 — e. We may apply this argument to any nested sequence of halfspaces, 
each distance at least 2K% + K§ apart, so the probability that you converge into the limit set of a 
halfspace H(l,x) with d(l,x) > (2K S + K 6 )n is most (1 - e)( n-1 ). 

For more general probability distributions, we need to replace the condition of hitting the equidis- 
tant set of a halfspace, with the condition of hitting some suitably large neighbourhood of the 
equidistant set. 

Let Hi be a sequence of halfspaces H(l,Xi), where the Xi lie on a common geodesic starting at 
1, with d(l, Xi+i) ^ d(l, Xi) + 2Kg + K$ for each i. Consider a sample path which converges to a 
boundary point A in H^- Such a sample path must have infinitely many elements in H\. If such a 
sample path does not hit Hi \ Hk then it must first hit Hi inside H^, after a jump of distance at 
least k(2Ks + Kq). Pick k such that k(2K$ + Kq) is larger than the relative diameter of the support 
of fx. 

For a sample path w, let F(w) be the group element corresponding to location at which u> first 
hits Hi. This is well defined for sample paths which converge to A £ i?2- All sample paths which 
converge to A G Hk hit Hi \ Hk, so conditioning on F(ui) gives 

P(A(w) G Hk+i I A(w) G H k ) ^ J2 P ( F ( W ) = -9) vig^HkTi) 
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By Proposition ^. 101 for any point g in Hi there is a point b in -ff^+i such that -fffc+i C H(g, &), 
and d(g,b) ^ K$. Therefore, by Proposition 15.31 v{g~ 1 Hk+i) ^ 1 — e, so 

P(A(w) G i? fe +i | A(w) G i? fe ) «S 1 - e 

This implies that v{Hk+i) v(Hk){l — e), so by induction v{Hk+i) ^ (1 — e) i+1 . Therefore 
< L", for d(l,ar) > A' 9 , where we may choose L = (1 — e) 1 /^!) an d # g = fc(2A' 8 + K 6 ). □ 

The measure ^ is the weak limit of the measures [i n , so one may hope there is exponential decay 
for halfspaces for the /^-measures, at least for large n. In fact, we now show that there is an upper 
bound for fi n (H(l, x)) which decays exponentially in d(l,x), for all n. 

Lemma 5.5. There are constants K\g,L < 1 and Q, which only depend on 6 and fi, such that if 
H(l,x) is a halfspace with d(l,x) -KiOj then fi n (H(l,x)) ^ QL d ^' x \ 

Proof. We shall choose K\q = Kq + 2K$ + Kg. Let H(l, x) be a halfspace with d(l, x) ^ 2K$ + K§. 
Then, by Proposition 13.101 we may choose y £ with d(l,y) = d(l,x) — Kg — 2Kr, such that 

H(l,x) C H(l,y), and, furthermore, for any b £ H(l,x), there is a point a in H(y,l) such that 
the halfspace H(y, 1) is contained in the halfspace H(b,a), with d(a,b) Kg. By Proposition 15. 3[ 
the harmonic measure of H(y, 1) viewed from any b £ H(l,x) is at most 1 — e, i.e. if is the 
harmonic measure induced by a random walk starting at b instead of 1, then Vb{H{y 1 1)) $J 1 — e. 
At time n, the halfspace H(l,x) has measure (j, n (H(l,x)), so the proportion of these sample paths 
which converge into the limit set of H(y, 1) is at most 1 — e. Therefore at least e of these sample 
paths converge into the complement of the limit set of H(y, 1), which is contained in the limit set 
ofH(l,y), i.e. v(H(l,y)) ^ efi n (H(l,x)). This implies 

fi n (H(l,x)) \v{H^y)). 

We chose d(l,x) > K 6 + 2K$ + Kg, and d(l,y) = d(l, x) — Kg — 2Kr, so we may apply the exponential 
decay bounds from Lemma 15.41 to obtain 

H n (H(l,x))<U^- K »- 2K ?. 

Therefore we may choose Q to be lL~ K8 ~ 2Kr , and this depends on S and but not on x or n. 
Therefore we have shown that there is a constant Q such that /j, n (H(l,x)) ^ QL d ( 1,x >, for all n, as 
long as d(l, x) ^ K w = Kg + 2Kg + Kg. The constant L may be chosen to have the same value as 
the constant L from Lemma T5.4I □ 

We remark that for the nearest neighbour random walk, for small n and d(l, x) large, (i n (H(l, x)) 
will be zero until n is at least d(l,x), so fi n (H(l,x)) need not be monotonically decreasing in n. 

6 Linear progress 

We may now complete the proof of the main theorem. 
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Theorem 11.11 Let G be the mapping class group of an orientable surface of finite type, which is 
not a sphere with three or fewer punctures, and consider the random walk generated by a probability 
distribution [i, whose support is bounded in the relative metric and which generates a non- elementary 
subgroup of the mapping class group, and which has finite first moment. Then there is a constant 
£ > such that lkrin-^oo ^d(l,w n ) = £ almost surely. 

The fact that the limit —d(l,w n ) exists almost surely with respect to P follows from a standard 
application of Kingman's subadditive crgodic theorem. The main task of this section is to show that 
this limit is strictly greater than zero. We now state Kingman's subadditive ergodic theorem [5J, 
using the version from Wocss [21 Theorem 8.10]. 

Theorem 6.1. [HIE] Let (f2,P) be a probability space and U : fl — > a measure preserving 
transformation. Lf W n is a subadditive sequence of non-negative real-valued random variables on f2, 
that is, W n +k ^ W n + Wk o U n for all k,n 6 N, and W\ has finite first moment, then there is a 
U -invariant random variable Woo such that 

lim \W n = 

V-almost surely, and in expectation. 

We will choose (fi,P) to be the path space (G Z +,P) for the random walk determined by (G,/x). 
We will choose U to be the Bernoulli shift in the space of increments of the random walk, which is 
an ergodic measure-preserving transformation on (G Z +,P). Set W n = d(l,w n ). Then Wk ° U n — 
d(l, w~ l w n+ k), which is the relative distance between w n and w n +k, so the triangle inequality implies 
that W n is subadditive. We have assumed that [i has finite first moment with respect to the word 
metric on G, and as d(l,g) ^ \g\ for all g S G, this implies that \i also has finite first moment with 
respect to the relative metric on G. Theorem 16.11 then implies that lim,!-^ ^d(l,w n ) = £ exists 

almost surely, and in expectation, and in fact is constant almost surely. As d(l,w n ) for all n, 
this implies that £ ^ 0. It remains to show that t is strictly larger than zero. As the limit is constant 
almost surely, and the limit exists in expectation, it suffices to show that the limit iE(<i(l, w n )) is 
bounded away from zero. We will show that for m sufficiently large, the expected difference between 
d(l,w n ) and d(l,w n + m ) is bounded away from zero. 

Lemma 6.2. There is a constant N, such that if m > N, then E(d(l, w n -\- m ) — d(l,w n )) ^ 8 > 0. 
The constant N depends on fi, but is independent of m and n. 

This suffices to prove Theorem 1 1.1[ as we now explain. Consider 

E(eJ(l, W2n)) = E(d(l,w 2 N) ~ d(l,w N ) + d(l,w N ) - d(l,w )) 

This is equal to 

= E(d(l, w 2N ) - d(l, w N )) + E(d(l, w N )) > 26 

A similar argument shows E(<i(l, Wfciv)) ^ kS. As lhrin^oo —d(l,u) n ) = £ is constant almost surely, 
this shows £ ^ S/N > 0, as required. 
We now prove Lemma 16.21 
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Proof. Consider doing a random walk of length n, followed by one of length m. We can compute 
the expected change in relative length from time n to time n + m, which we shall denote A n>m . 

A n , TO = E(d(l,w n+m ) - d(l,w n )) 
We may rewrite this using the definition of expected value, giving 

A„, TO = ^2 Hm(y){d(l,xy) - d(l,x)). 

xeG yeG 

The fact that \i has finite first moment means that this sum is absolutely convergent, so we can 
swap the order of summation. Furthermore, as d(l,xy) = d(x~ 1 ,y), and d(l,x) — d(x~ 1 , 1) we can 
rewrite this as 

A n ,m = ^2 Vmiv) ^2 ^n(x){d(x~ 1 ,y) - d(x~ l , 1)) 

yEG xEG 

We may split the sum up into two parts, depending on whether or not d(l,y) > A, where A is a 
constant which depends on fi. We will choose A ^ K & + 2K 8 + Kg + 125, and furthermore, we 
will choose A to be sufficiently large such that rU < S/2Q, for all real numbers r ^ A, and this is 
possible as L < 1. Here the constants K e ,K 8 ,K 9 ,L and Q are those defined previously in Sections 
[3] and O though we emphasize that Q and L are the constants defined for the reflected random walk 
(G, Jl). We shall write B a for the ball of radius A about 1 in G. 

A„, m = ^2 Mm(y) ^ Vn{x){d(x~ l ,y) - d(x~ l ,l)) (6.1) 

yeS A x£G 

+ ^wE^)^ 1 " 1 ^)-^" 1 .!)) ( 6 - 2 ) 

y£G\B A zeG 

We now find lower bounds for lines (|6.1[) and f|6.2|l in turn. First, in line (|6.1j) . d(x^ 1 ,y) — 

c?(a; _1 ,l) ^ — d(l,y), by the triangle inequality. We have also assumed that d(l,y) < A. This 
implies the following lower bound for (|6.1[) . 

(EU) > (6.3) 

Wc now find a lower bound for line (|6.2j) . and we will simplify the notation by replacing x with x~ 1 . 
This makes no difference to the sum as we are summing over all elements of G. 

(GCID > J2 ^M^^'^fifav)-^ 1 )) 

y€G\B A 

Let N be the collection of x 6 G for which d(x,y) — d(x, 1) ^ 0, i.e. iV is the halfspace H(l,y). 
Let P be the region on which d(x,y) — d(x, 1) 3(5, so P is the halfspace H(y, 1; —35), using the 
notation from Proposition 13 . 131 This is illustrated below in Figure [TBI 
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Figure 13: Positive and negative regions. 



Let z be a point on [l,y] distance 125 from y. We now show that the region P contains the 
halfspace H(z, 1). The complement of P is contained in H (1, y; 35), and by Proposition 13.131 the 
complement of P is contained in H(l,z), for d(l,z) = d(l,y) — 125. Therefore P contains the 
halfspace H{z, 1). By the triangle inequality, d(x, y) — d(x, 1) ^ —d(l, y) on TV, and d(x, y) — d(x, 1) ^ 
35 on P. This gives the following lower bound for line (|6.2|) . 



> Ml/) [-d(l, y)Ji„(H(l, y)) + 36jl n {H(z, 1)) 

yeG\S A 

Here jl n is the reflected random walk measure, i.e. ]l n ( x ) = /• t 7iO c_1 )- Using that fact that 
Ji n (H(z, 1)) ^ 1 - ]l n (H(l, z)), we obtain 

(E2D > J! Hm(y)[-d(l,y)MH(l,y)) + 36(l-Ji n (H(l,z))) 

yeG\S A 

In line (|6.2j) we have assumed that d(l, y) ^ 2Kg + Kq + 125, so the halfspaces H(l, y) and H(l, z) 
both satisfy the hypotheses of Lemma 15.51 so we may estimate Jl n in terms of v. 



§M> Yl l*m(y)[35-d(l,y)Qv(H(l,y))-5Qv(H(z,l)) 

y£G\B A 

Here v is the harmonic measure corresponding to the reflected random walk defined by (G,J1), 
and Q is a constant which depends on Jl, but not on y or z. In line (|6.2j) we have assumed that 
d(l, y) ^ Kq + 125, so we may apply Lcmma l5.4[ which says that the harmonic measure of halfspaces 
decays exponentially. 

> E ^n(y)[35-Qd(l,y)L S ^ -Q5L S ^' 

y&G\B A 



Now using the facts that d(l,y) ^ 5, and we chose A such that rL r ^ 5/2Q for all r A, this 
implies that the term in square brackets in the line above is at least 25. Therefore 



> (1 - fx m (B A ))25. 



(6.4) 
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Recall that A n . m = (|6.ip + (|6.2j) . and now using the lower bounds from lines (|6.3p and f|6 .4[) . we 
obtain 

A„,™ > 2<5- Mm (i? J 4)(A + 25). 

The harmonic measure of a bounded set in G is zero, and so the random walk is transient on 
relatively bounded sets. In particular, for fixed A there is an N such that fx m (BA) < 8/(A + 26) for 
all m ^ N , and this will be our choice of N, which depends on pt. In particular, this means that 
A„ >m ^ <5, for all n, and for all m TV, and so A„ :m is bounded away from zero, as required. □ 

This completes the proof of lemma 16.21 and hence of Theorem 11.11 
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